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Abstract 



We show that the widely used homotopy method for solving fixpoint problems, as well as the 
Harsanyi-Selten equilibrium selection process for games, are PSPACE-complete to implement. Extend- 
ing our result for the Harsanyi-Selten process, we show that several other homotopy-based algorithms 
for finding equilibria of games are also PSPACE-complete to implement. A further application of our 
techniques yields the result that it is PSPACE-complete to compute any of the equilibria that could be 
found via the classical Lemke-Howson algorithm, a complexity-theoretic strengthening of the result 
in ll24l . These results show that our techniques can be widely applied and suggest that the PSPACE- 
completeness of implementing homotopy methods is a general principle. 
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1 Introduction 



According to Roger Myerson ll20i . the 1950 publication of Nash's paper on equilibria was a watershed event 
not just for Game Theory, but for Economics in general. The new general equilibrium concept, and its estab- 
lished universality, was an impetus for understanding rationality in much more general economic contexts, 
and inspired the important price equilibrium results by Arrow and Debreu. Myerson argues convincingly in 
|f20J that the concept of Nash equilibrium lies at the foundations of modern economic thought. 

Seen from an algorithmic perspective, however, the Nash equilibrium suffers from two important prob- 
lems: First, it is not clear how to find it efficiently (the same is true for the Arrow-Debreu variety for markets 
and prices). This shortcoming had already been identified by economists since the 1950s, and much effort 
has been devoted to algorithms for finding Nash equilibria, see [251 [T9l [14] for examples from a very ex- 
tensive literature. None of these algorithms came with polynomial-time guarantees, however, and the recent 
result |l6j HI establishing that the problem is PPAD-complete explains why. Of the many algorithmic ap- 
proaches proposed by economists over the past 50 years for finding Nash equilibria, most have been shown 
by now to require exponential time in the worst case 1241 [T6ll . One exception is an important algorithmic 
genre known as homotopy methods Q ; see llTSl for a recent survey. 

In topology, a homotopy is a continuous transformation from one function to another (as, for example, 
between two paths joining two points on a map). The homotopy method starts with a fixpoint problem that 
is easy to solve (say, a rotation of a disc around its center), and continuously transforms it into the problem 
in hand, by "pivoting" to new fixpoints along the way. A theorem by Browder 121 establishes the validity of 
this method in the limit, by showing the existence of a continuous path of fixpoints that joins two fixpoints 
of the initial and the final problems. 

The second algorithmic obstacle for the Nash equilibrium concept is multiplicity. Games have multiple 
equilibria, and markets many price equilibria, and thus the corresponding equilibrium concepts are only 
nondeterministic predictions (oxymoron intended). In price equilibria, this multiplicity has been blamed 
for economic crises: The path guaranteed by Browder's theorem is non-mono tonic, going back and forth 
in time. As a result, equilibria vanish at its folds, leaving the market in turmoil [1]. In games, a proposed 
remedy for multiplicity is the so-called focal point theory see, e.g., ifTSl p. 414, postulating that players 
implicitly coordinate their equilibrium choice by focusing on the most obvious, or mutually advantageous, 
equilibrium; repeated play and learning (see, e.g., [9]) can also be considered a remedy for multiplicity. In 
1975, Harsanyi proposed the tracing procedure [10] for battling equilibrium multiplicity, a theory further 
explicated in his joint 1988 book "A General Theory of Equilibrium Selection in Games" with Selten [ 1 1 1 
(Harsanyi and Selten shared in 1994 the Nobel prize with Nash). The tracing procedure asserts that players 
engaged in a game Q play at first a simple game Qq, in which their prior beliefs about the other players' 
behavior result in a dominant strategy. As time t progresses, and their priors are falsified by life, they play 
a more and more realistic game = (1 — t)- Qq + t- Q, until, at time t = 1, they end up playing the 
intended game Q. They show that, for almost all games, tracing the equilibrium path of this process results 
in a unique equilibrium. Notice the parallel with the homotopy method; apparently the two were discovered 
independently. 

Our results. This paper is a complexity-theoretic critique of the tracing procedure and the homotopy 
method: we show that finding the solutions they prescribe requires the power of PSPACE. In particular, 
finding the Brouwer fixpoint that would have been discovered by the homotopy method, for a simple start- 
ing function and an adversarial final one, is PSPACE-complete. The same is true, via standard reductions, 
for price equilibria. We also construct examples where the homotopy method not only will undergo an ex- 
ponential number of pivots (this was expected since |[T6]| ). but will suffer an exponential number of direction 
reversals. As for the tracing procedure, we show that it is PSPACE-complete to find the Nash equilibrium 
selected by it, even in two-player games, and even if the initial game has dominant strategies obtained from 
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priors, exactly as prescribed by Harsanyi and Selten. We extend this result to homotopy-based algorithms 
where the starting game depends on the final game and show that it is PSPACE-complete to implement 
the Herings-van den Elzen, Herings-Peeters, and van den Elzen-Talman algorithms for finding equilibria in 
games. Finally, it is particularly noteworthy that PSPACE-completeness prevails even for finding the solu- 
tions that would be returned by the classical Lemke-Howson algorithm, a simplex-like method that had long 
been considered an oasis of conceptual simplicity and (until Il24l ) of algorithmic hope in this field. This 
reinforces the "exponentially long paths" result of |24| with a new result which says that, subject only to 
the hardness of PSPACE, no short cuts to Lemke-Howson solutions are possible (for any of the different 
initial choices of the algorithm). Since it is known that the Lemke-Howson algorithm can be expressed as a 
homotopy fTSl , this result can also be seen as a powerful specialization of our first result. 

The algorithms we consider solve problems in the complexity class PPAD, which is contained in TFNP, 
the class of all total function problems in NP. Another prominent complexity class contained in TFNP 
is PLS (for polynomial local search). Many common problems in PLS (e.g., local max cut and finding 
pure equilibria of congestion games) are complete under a so-called tight PLS -reduction, implying that the 
corresponding standard local search algorithm is exponential (for certain starting configurations and any 
choices of the local search algorithm). Furthermore, one can conclude that the computational problem of 
finding a local optimum reachable from a given starting configuration by local search is PSPACE-complete. 

No such concept of tight reductions is known for PPAD, and our results can be seen as addressing this 
deficiency. Specifically, we show the PSPACE-completeness (and exponential worst-case behaviour) of a 
number of homotopy-based algorithms for finding equilibria. Our reductions start with the problem Other 
END OF THIS LINE (Oeotl), which is related to the problem End of the line used in the definition is 
PPAD, seeking not just any end of a path, but the other end of the particular path starting at the origin. 
Oeotl was known to be PSPACE-complete since 1,23 J . but this fact has so far remained unexploited for 
proving lower bounds for other problems. 



Outline of the paper. In Section 2.1 we give an overview of the linear homotopy method as applied to 
Brouwer functions and games. In Section 2.2 we recall the PSPACE-complete problem Oeotl (Other 
End of this Line), which serves as the starting point for all our main reductions. In Section |3} we show 
that the linear homotopy method to compute a Brouwer fixpoint is PSPACE-complete, which is proved in 
Section 3.3 In Section [4j we establish the PSPACE-completeness of the Unear tracing procedure for two- 
player strategic form games for a special starting game that is independent of the final game. These results 
are extended to starting games that depend on the final game in Section[5} where we show that it is PSPACE- 
complete to implement the Herings-van den Elzen, Herings-Peeters, and van den Elzen-Talman algorithms 
for computing equilibria of games. The techniques of ||4l0 are central to both Section|3]and Section|4]and 
are recalled and extended along the way. Finally, in Section [6} we show that it is PSPACE-complete to find 
any solution of a two-player game by the Lemke-Howson algorithm. 



2 Preliminaries 
2.1 Homotopies 

A Brouwer function is a continuous function from a convex and compact domain D to itself; by Brouwer's 
fixpoint theorem there exists x ^ D such that J^{x) = x. A homotopy between two functions : X — Y 
and Ti : X — > Y (where X and Y are topological spaces) is a continuous function H : [0,1] x X — > Y 
such that for all x £ X, H{0,x) = To{x) and H{l,x) = Fi{x). In this paper, we are interested in the 
special case where X = Y = D,for D n closed compact subset of Euclidean space, such as a cube. Thus, 
Fq and Fi are Brouwer functions on D. Given two continuous functions Fq,Fi : D — D, the linear 
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homotopy is given by the expression H{t, x) = {1 — t)TQ{x) + tJ^i(x), and (if D is convex) results in a 
continuum of Brouwer functions '■ D — > D given by J^i = (1 — t)- J13 + J^i for i G [0,1]. 

Browder's fixpoint theorem p2ll (not to be confused with Brouwer's fixpoint theorem) asserts that given 
a homotopy connecting Tq and Ti, there is a path in [0, 1] x D from some fixpoint of To to some fixpoint 
of Ti, such that for every point {t, x) on that path, 2; is a fixpoint of J^t- The homotopy method JT) [151 for 
finding a fixpoint of Ti selects Tq to have a unique and easy to find fixpoint, and essentially follows such a 
path. As noted in [ 15 1, we do not expect the path to be monotonic in t — indeed, we show in Section [8] that 
an exponential number of direction reversals is possible. 

We are often interested in approximate fixpoint^ If is a Brouwer function, an e-approximate fixpoint 
is a point x such that \F{x) — x| < e (we shall use the Loo nietric throughout). It follows from Browder's 
theorem that, for any Fq,Fi, there is a finite sequence xq, xt^, . . . , x^^, xi of e-approximate fixpoints of 
Fq, Tt^ , . . . , Tty, , T\, for some k and ti,. . . ,tk, such that any two consecutive fixpoints in the sequence are 
at most e apart. 

We shall be interested in the following problem, which we call Browder FIXPOINT: Given two arith- 
metic circuits computing two functions To and from [0, l]'^ to itself with Lipschitz constant ^, an e > 0, 
where J^q has a unique fixpoint xq, find an e-approximate fixpoint xi of J^i that is connected via a sequence 
of e-approximate fixpoints to xq. (To make this definition precise, we of course have to identify classes of 
functions from which Tq and Ti may be drawn.) Notice that the homotopy method for computing Brouwer 
fixpoints provides a solution to this problem. 

Homotopies can be defined very similarly also for games. Given two games Qq, Qi of the same type 
(number of players and strategies), we consider Qt = (1 — t)- t/o + ^" ^1. where it is the players' utilities that 
are interpolated. It is routine to extend this definition to more general classes of games, such as graphical 
games [17] (in which case, in addition to the players and strategies, the two graphs must be the same). 
Browder's theorem, via Nash's reduction, establishes that there is a path of approximate Nash equilibria 
here as well. The problem LINEAR TRACING is the following: Given two games and Qi, an e > 0, 
and a Nash equilibrium xq of Qq, find an e-approximate Nash equilibrium xi of Qi that is connected via a 
sequence of e-approximate Nash equilibria to xq. 

It is easy to see that Linear tracing is in PSPACE, and it can be checked that the algorithm of Herings 
and van den Elzen |[T3l achieves this(see Section [T!!]). Browder Fixpoint is also in PSPACE. 



2.2 Other End Of This Line 

We consider directed graphs on 2" vertices represented as n-bit vectors. The arcs are represented by two 
polynomial-size circuits S and P, each having n inputs and outputs, as follows. There is an arc from vertex 
vtow provided that S{v) = w and P{w) = v. Notice that all vertices of the graph have both indegree and 
outdegree or 1, that is, the graph consists of paths, cycles, and isolated vertices. 

Definition 1 An (S, P)- graph with parameter n is a graph on {0,1}" specified by circuits S and P, as 
described above, subject to the constraint that vertex 0" has no incoming arc but does have an outgoing arc. 

The problem End of the line is the problem of finding a vertex of a given {S, P)-graph other than 
0" which has at most one incident arc. Note that this problem is in the class TFNP of total search problems 
in NP: there exists a solution that could be obtained by following the directed path that starts at 0", and 
any given solution may be efficiently checked for correctness. The class PPAD |[23l is defined as all search 
problems polynomial-time reducible to End OF THE LINE. The problem Other end of this line (which 

' A very interesting alternative consideration [ 8 1 focuses on exact fixpoints, resulting in higher complexity of the search problem; 
here we could also consider exact fixpoints and equilibria without much effect on our results, since we are dealing with PSPACE- 
completeness. It is known from (8) that this harder problem belongs to PSPACE. 
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we will subsequently abbreviate to Oeotl) is the problem of finding the end of the particular path that starts 
at 0". In contrast with End of the line, a given solution to an instance of Oeotl has no obvious concise 
certificate that it is the correct endpoint, so while Oeotl is a total search problem, it is apparently not an 
NP total search problem. In fact, we have the following (Theorem 2 of 1,23,1 ). which is the starting-point of 
our reductions. 

Theorem 1 /Eil/ Oeotl is PSPACE-complete. 
2.3 Our Approach 

In 161, each instance / of End of the line is reduced to a game Qj in such a way that any Nash equilibrium 
of Qi efficiently encodes a solution to /. Here we reduce / to a homotopy problem defined by two games, 
(^0, Gj), where Qq depends only on n, while Qj encodes the circuits in /, and is an extension of Qj of f6|. We 
establish that the associated linear homotopy corresponds to the naive "follow the line" approach to solving 
Oeotl; technically, a suffix of the homotopy path corresponds to following the line, and the existence of the 
relevant suffix is established in a non-constructive way, using the intermediate value theorem. In extending 
the result to the Lemke-Howson algorithm, the main technical obstacle is the initial choice of which "label 
to drop", leading to multiple disjoint paths in the mixed-strategy profile space. We have to ensure that all 
of the 2n solutions (one for each pure strategy) efficiently encode the solution to /, where / is treated as 
an instance of Oeotl. This is done by embedding two copies of the game Qj inside a larger one in such a 
way that at least one copy does not contain the initially-dropped label, and arguing that any Lemke-Howson 
equilibrium restricted to this copy ends up encoding the unique solution to /. 



3 The Homotopy Method for Brouwer Fixpoints 

In this section we give detailed definitions of classes of fixpoint and approximate fixpoint computation prob- 
lems. In Section [37T] we review the definition of Brouwer-mapping functions — and related concepts — from 



Chen et al. [4J, here applied to a three dimensional domain. In Section 3.2 we review the techniques of [6, 4] 
for implementing Brouwer-mapping functions as arithmetic circuits. In Section 3.3 we prove Theorem [3j 
the main result of Section |3] in which we establish the PSPACE-completeness of a linear homotopy for 
finding a fixpoint of a Brouwer function, n G N will denote a complexity parameter of problem instances. 
We define a sequence Tq""^ of "basic Brouwer functions" having unique known fixpoints. For each n we 
define a class of Brouwer functions whose members encode {S, P) -graphs on {0,1}". The homotopy of 
Equation ([T]) defines a class of functions J^t, t G [0, 1], that interpolate between and and specifies 
a particular fixpoint of Fi. We will show that from that fixpoint, we can efficiently recover a solution to 
Oeotl for the graph encoded by Fi. 

3.1 Definitions and notation 

Notation 1 Let K be the unit 3-D cube [0, 1]'^. For n ^ K let /C^") denote a partition of K into 2^" 
"cubelets", /C^") = {Kijk : < i, j, < 2" - 1}; Kijk is an axis-aligned cube of length 2 " whose 
vertex closest to the origin has coordinates 2~"(z, j, k). 

We define a Brouwer-mapping circuit in a similar way to the definition in lH, here specialized to the 
case of 3 dimensions. We also introduce some variations of the definition, as follows: 

Definition 2 (Brouwer-mapping circuit/function; basic Brouwer-mapping function; DGP-style Brouwer- 
mapping function; partial Brouwer-mapping function) 
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A Brouwer-mapping circuit (bmc) is a directed boolean circuit with 3n input nodes and 2 output nodes. 
Note that any bmc B has an associated Brouwer-mapping function (bmf) Jb ■ /C^"^ — > {0, 1, 2, 3} that 
maps any cubelet Kijk to one of the four colors {0, 1, 2, 3}. We require the colors of all exterior cubelets 
to be predetermined as follows. Fori = 0, fB{Kijk) = 1- For j = 0, i > 0, fsiKijk) = 2. Fork = 0, 
i,j > 0, fB{Kijk) = 3. All other exterior cubelets are mapped to 0. 

The basic bmf /q"^ : /C^"^ — > {0, 1, 2, 3} has the additional property that all internal cubelets get 

(n) 

mapped to 0. Notice that /q is computable by a bmc of size polynomial in n. 

A DGP-style bmf is one that is derived from an {S, P)-graph in the manner of [6], and so is computable 
with a bmc of size polynomial in the size of circuits S and P. (Proposition^notes the relevant property of 
DGP-style bmf's.) 

A partial bmf / is defined with respect to a set S Q /C^"); / assigns a color to elements of S but f may 
be undefined on non-elements ofS. 

Proposition 1 The following problem is VSVAC^-complete. Given a Brouwer-mapping circuit B, find a 
point in K that is a vertex of 4 cubelets mapped to all 4 colors by the associated bmf fs, and which is 
connected to the origin via cubelets having colors other than 0. 

This is a total search problem: the topological intuition is that there is a line that is adjacent to the colors 
{1, 2, 3} and has one end at 2~"(1, 1, 0). The other end must be inside K and adjacent to color 0, since 
no other exterior point is adjacent to the colors {1,2,3}. We note in passing that if we did not make the 
"connected to the origin" requirement, the problem would be PPAD-complete. 

Proof. We reduce from Oeotl (Theorem[T]l: Let G be an {S, P)-graph. Let Jb be a DGP-style bmf derived 
from G, whose circuit B efficiently encodes G. 

Given a bmf Jb, define a {1, 2, 3}-chromatic vertex to be one that is shared by 3 cubelets with colors 1, 
2 and 3. By construction, the only exterior {1, 2, 3}-chromatic vertex for any bmf is 2^"(1, 1, 0). Form a 
digraph Gb on {1, 2, 3}-chromatic vertices by adding an arc between any pair that share a cubelet, directed 
such that if it pointing away from a viewer, its adjacent colors 1, 2, 3 will appear in clockwise order around 
it. The reduction of 161 ensures G b has indegree/outdegree at most 1 . 

Define a panchromatic vertex to be one that belongs to 4 cubelets of all 4 different colors. By con- 
struction, for all bmf's there is a path of {1, 2, 3}-chromatic vertices starting at 2~"(1, 1, 0) and ending at a 
unique panchromatic vertex Vend- Vend is a solution; it can be found in polynomial space by following this 
path. 

Let Gb be the graph on /^'s {1, 2, 3}-chromatic vertices as described above. The reduction of ||6l has 
the following properties, from which the result follows. 

1. Each vertex v of G has an associated {1, 2, 3} -chromatic vertex b{v) of Gb', v and b{v) may be 
computed in polynomial time from each other. For v = 0", b{v) = 2~"'{1, 1, 0). 

2. f is a solution to End of the line if and only if b{v) is panchromatic. 

3. Each arc {v, w) of G corresponds to a sequence of edges of Gb that connect b{v) to b{w). 

4. Each connected component of cubelets colored with {1,2,3}, corresponds to a connected component 
of G. 

□ 
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3.2 Implementing Brouwer-mapping functions as arithmetic circuits 

We review a class of functions used to establish PPAD-completeness of graphical and strategic-form games. 
Recall that K denotes the 3-dimensional unit cube; we consider continuous functions T : K — > K having 
the following structure. Each function is an arithmetic circuit composed of nodes, with each node taking 
inputs from up to 2 other nodes, and producing an output, for example, the sum of its inputs. All values are 
constrained to [0, 1], so a node that adds its inputs would output 1 if their sum is greater than 1. Identify 3 
nodes as "input nodes" and another 3 as "output nodes", so if is a continuous function from K to K, it 
has a Brouwer fixpoint. 

Definition 3 A linear arithmetic circuit is an arithmetic circuit that computes a function from K to K, 
represented by a directed graph whose nodes are "gates " that perform certain basic arithmetic operations 
on their inputs as follows. Each gate takes as input 0, 1 or 2 real values in [0, 1] and outputs a single real 
value in [0, 1], where the output of a gate may be the sum/difference/max/min of two inputs, or a constant 
multiple of a single input, or no input and constant output. (An output value is set to 1 if for example two 
inputs that sum to more than 1 are input to a "sum" gate.) We also allow "comparator gates" in which the 
output of such a gate evaluates to 1 (respectively, 0) if its first input is greater (respectively, less) than the 
second input, and may take any value if they are equal. 

Notation 2 Let a = 2"2n_ i^t §^ = (a, 0, 0), 62 = (0, a, 0), 63 = (0, 0, a), 60 = {-a, -a, -a). 

Definition 4 We shall say that a Brouwer-mapping function f is implemented by an arithmetic circuit C if 

whenever f{Kiji.) = c, then C (x) — x = 5c when x is at the center ofKij^. For x not at a center, C (x) — x 
should be a convex combination of values of C{z) — zfor cubelet centers z within Loo distance 2~" ofx. 
Given T : K — > K computed by such a C, we shall similarly say that T implements /. 

Observation \ If T implements f, then any fixpoints of T must lie within distance 2~" of panchromatic 
vertices of f, and vice versa. 

Theorem 2 A Brouwer-mapping function having complexity parameter n can be implemented using a linear 
arithmetic circuit having poly[n) gates, that computes a continuous function. 

The proof gives a new technique to implement any Brouwer-mapping function / as a continuous function 
F that uses a linear arithmetic circuit. This is in contrast with the corresponding techniques of |4, 6 | that used 
a sampling-based approach in order to smooth the transition between distinct cubelets. The sampling-based 
approach results in discontinuous functions, where Browder's theorem would not be applicable (although 
it could still be applied to a continuous approximation). The technique only works in constant dimension; 
if can be extended to higher dimension using the "snake-embeddings" of f4l. (See Section |9]) The general 
idea of the technique is to take a simplicial decomposition of the domain K, give rules for obtaining the 
values of F at the vertices of the decomposition, and linearly interpolate within each simplex. 

Proof. Let / : /C^") {0, 1, 2, 3} be a Brouwer-mapping function. We construct a continuous Brouwer 
function F : K — > K computed by a linear arithmetic circuit C as follows. 

For x at the center of cubelet Kijk, set F{x) — x = 5c where c = f{Kijk). For x a vertex of cubelets 
ICx C /C("\ set F{x) — x to be the average of F{z) — z for all points z at the centers of members of /C^. 
The relevant points z can be obtained using a polynomial-sized piece of circuitry. 

Let 5 be a simplicial decomposition of the unit cube consisting of 12 simplices that share a vertex at the 
center of the cube, and all other vertices are vertices of the cube. Let Sijk be the simplicial decomposition 
of cubelet Kijk obtained by scaling S down to Kij^. Applied to all cubelets in /C*^"^ this results in a highly 
regular decomposition 5*^"^ of K into 12.2^" simplices. 
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For any x ^ K, F{x) is obtained by linearly interpolating between the vertices of the simplex in 5^") 
that contains x. Clearly F is continuous. 

The result follows from the following claim: 

Proposition 2 T as defined above, may be computed by a linear arithmetic circuit of size polynomial in n. 

Proof. If X is not a vertex of S^"'\ the circuit can determine the vertices of a simplex Sx G 5^"^ that contains 
X. There may be more than one such simplex, in which case it does not matter which is chosen. 

The circuit has 12 cases to consider, depending on the orientation of 5^:. Each case can be handled in 
the same general manner, by subtracting some vertex v of Sx from x, and multiplying (x — v) by some 
constants (the coefficients of the linear function that interpolated between the vertices of Sx)- Note that we 
never need to multiply two computed quantities together, multiplication only ever takes place between a 
computed quantity and a constant, as required for a linear arithmetic circuit. □ 

□ 

3.3 The PSPACE reduction to linear arithmetic circuits 

In this subsection, we establish the PSPACE-completeness of the problem Browder FIXPOINT, mentioned 
in the Introduction, which can now be made precise as follows. We use two bmfs /o and /i, where /o is 
the basic bmf of Definition |2j and /i shall be a DGP-style bmf that encodes an instance of End OF THE 
LINE as constructed in [6|. Let Tq and Ti be implementations of /o and /i using linear arithmetic circuits 
as described in the proof of Theorem [2] For T : K — > K let .F^*^ denote the z-th component of T. For 
i = 1,2, 3 let 

= max(min(J-«,^«),J-f ) ^ ' 

where in ([T]l, the outputs of operators + and — are restricted to lie in [0, 1] (so, rounding to or 1 if needed). 
Ft interpolates continuously between Fq and Fi and is constructed from them using elements of the linear 
arithmetic circuits of Definition [5] (which is useful later; the natural alternative Ft = tF^ + (1 — t)Fi does 
not have this property.) 

Observation 2 For all t £ [0, 1], F^^^ is Lipschitz continuous, with Lipschitz value < 2.2~". 

Fo has a unique fixpoint close to 2^"(1, 1, 1). Fq is a "basic Brouwer function" which forms the 
starting-point of homotopies we consider. Hence Observation [2] and Browder 's fixpoint theorem implicitly 
define a corresponding fixpoint of Fi. 

Define an approximate fixpoint of F : K — > to be a point x £ K with \F{x) — x\ < a/5 (recall 
a = 2-2"). 

Theorem 3 It is PSPACE-complete to find, within accuracy 2^", the coordinates of the fixpoint of Fi that 
corresponds to the homotopy o/Q. It is also PSPACE-complete to find the coordinates of an approximate 
fixpoint of Fi that would be obtained by following a sequence of approximate fixpoints of Ft in which 
consecutive points are within distance a of each other 

Proof. We reduce from the problem defined in Proposition [T] as follows. Let B he. n Brouwer-mapping 
circuit derived from OEOTL-instance (S", P) using Proposition [T]and let /b : /C(") ^ {0, 1, 2, 3} be the 
function computed by B. Let Fi : K — > K be the function computed by a linear arithmetic circuit that 
implements /s, and Fq be computed by a circuit that implements the basic bmf /o (where both implemen- 
tations apply Theorem [2]). Ft is given by ([T]|. 
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Let P be a connected subset of K x [0, 1] such that for any (x, t) G P, x is a fixpoint of Tt, and P 
contains xq G (^,0) and x\ G [K^ 1). Browder's fixpoint theorem (with Observation [2]) assures us that 
such a P exists. We claim that x\ is within distance of the unique solution to B of the problem specified 
in Proposition [T] (and hence, given x\ we can easily construct this solution). 

Suppose otherwise. For x\ to be a fixpoint (even an approximate one) of J^i, by Observation[T]it must be 
within distance 2~" of a panchromatic vertex v of fs- But now, v is not connected to the origin via non-zero 
cubelets of /b- By connectivity of P, there must exist (x, t) ^ P such that x lies within a cubelet where 
fB{K^) = 0. 

We may assume further that x is at least 2^^ distant from any non-zero cubelet of fs- This follows 
provided we assume that connected components of non-zero cubelets of fs are separated from each other 
by a layer of 0-colored cubelets of thickness at least 3. This may be safely assumed by increasing n by a 
factor of 3 and subdividing the cubelets. We note that 

1. each entry of vector Toi^) — x is < —a/5, and 

2. each entry of Ti{x) — x is < —a/5. 

It follows that for t G [0, 1], each entry of ft{x) — x is less than —a/5, since coordinatewise, fo < ft < Jb- 
That means that x cannot be an approximate fixpoint of any ft, contradicting the assumption as required. 

Since x is at least 2^" distant from any non-zero cubelet of fs, it is also at least 2^" distant from any 
non-zero cubelet of /o, since for any cubelet Kij}^, fB{Kijk) = =^ fo{Kijk) = 0. The implementation 
of any bmf / as a function F computed by a linear arithmetic circuit, as referred to in Theorem |2j ensures 
that F{x) — X is a convex combination of vectors F{z) — z for cubelet centers z in the vicinity of x, and 
since all those cubelet centers are colored 0, we have that the entries of T{x) are all less than —a/h, as 
required. □ 



4 The Linear Tracing Procedure 

We now turn to games and Nash equilibrium. Let Q denote an n x n game that we wish to solve, assumed 
to be chosen by an adversary. Qq is a game with a unique "obvious" solution. In Qq each player receives 
payoff 1 for his first action, and payoff for all others, regardless of what the other player does. 
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In the problem LINEAR TRACING the solution consists of the Nash equilibrium of Q that is connected 
to the unique equilibrium (sg, Sg) of Qq via equilibria of convex combinations (1 — t)QQ + tQ. We can also 
define an approximate version of this problem, where instances include an additional parameter e, and we 
seek an e-Nash equilibrium that is connected to the solution of Qq via a sequence of e-approximate solutions 
of Qt- For the two-player case we assume e = 0. For more than 2 players, we need a positive e to ensure 
that solutions can be written down as rational numbers. 

Theorem 4 Linear tracing is PSPACE-complete for 2-person games. 

The same result then holds for strategic-form games with more than 2 players. It holds for a value of 
e that is exponentially small; we could again use the ideas of to obtain a version where e is inverse 
polynomial. 
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Our reduction uses the result of the previous section, along with earlier reductions between strategic- 
form games and graphical games. Qq has a similar role to the basic Brouwer function Tq, but the correspon- 
dence is indirect; generally is associated with one of the "intermediate games" Qt for t > 0. 

4.1 Brief overview of the proof ideas 

The following is a brief overview of the rest of this Section |4] Membership of PSPACE can be deduced 
from 113]. The reduction from the PSPACE-complete discrete Brouwer fixpoint problem of the previous 
section, applies the idea from f6l of going via graphical games to normal-form games. We derive a type 
of graphical game in which a specific player (denoted VswUch) acts as a switch, allowing the remaining 
players to simulate either the basic Brouwer-mapping function, or one associated with an instance of the 
search for a discrete Brouwer fixpoint. VswUch governs this behavior via his choice of either one of two 
alternative strategies, and we show that a continuous path of equilibria from one choice to the other, results 
in an equilibrium that ultimately represents a solution to Oeotl. The graphical game is then encoded as a 
2-player game such that the linear-tracing procedure corresponds to this continuous path of equilibria in the 
graphical game. 

4.2 Membership of PSPACE 

Herings and van den Elzen ifTSl show how to find approximate equilibria on multi-player games, implicitly 
constructing a degree-2 graph that has a vertex corresponding to A/q, the Nash equilibrium of Qq. Given a 
simplicial decomposition of D x [0, 1] (where D is the space of mixed strategies of Q) vertices of the graph 
correspond to simplices and subsimplices, and edges are implicitly defined by a lexicographical pivoting 
rule that governs a choice of movement from simplex to adjacent simplex, at each step of the algorithm. It 
can be checked that this algorithm establishes membership of PSPACE for multiplayer Linear tracing. 

4.3 Graphical Games 

In a graphical game ifTTl . each player is a vertex of a graph, and his payoffs depend on his own and his 
neighbors' actions. For a low-degree graph, this is one way that games having many players may be repre- 
sented concisely. A homotopy between two graphical games QQq and GGi would require that these games 
have the same underlying graph, so that they differ only in their numerical payoffs. In the graphical games 
considered here, each player has just 2 actions and 3 neighbors. The main result of this section is 

Proposition 3 Consider graphical games that contain a special player v switch whose payoffs are constant 
( unaffected by his own actions or the other players '). The following problem is PSPACE-complete: find a 
Nash equilibrium of the game where v switch plays I, that is topologically connected to a Nash equilibrium 
in which v switch plays 0, via a path of Nash equilibria in which v switch plays mixed strategies. 

Let Fq and Fi be functions computed by linear arithmetic circuits that implement Brouwer-mapping 
functions /o and /i, where /o is the "basic bmf" of Definition |2j and /i is a DGP-style bmf that encodes 
some instance of End of the line. 

Notation 3 In a graphical game in which all players have 2 pure strategies denoted and 1, given a mixed- 
strategy profile for the players we let p[f ] denote the probability that player v plays 1. 

Definition 5 (Linear graphical game; simulation of bmf 's and partial bmf's) 

Given a bmf f, we construct an associated graphical game QQ f as follows. QQ f has 3 special players 
{vx,Vy, Vz) whose strategies ip[vx],p[vy],p[vz]) represent a point in K. If f is implemented by F : K — > 
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K we use gadgets of ^ to simulate the nodes in the arithmetic circuit that computes J- (each node of the 
circuit has an additional associated player in QQ f). The game can pay them to adjust (p[ux], pii'j/], P^z]) 
in the direction J~{jp\vx ], p[vj,], pii'z]) — (pI^^x], Pbj/]; pI^z])- Then the players (v^, Vy,Vz) are incentivized 
to play J-{p[vx],p[vy],p[vz]). Consequently a Nash equilibrium of QQ f corresponds to a fixpoint of T. 
Moreover, an e-Nash equilibrium corresponds to a poly (e) -approximate fixpoint of T. We call QQ f a 
linear graphical game since we only allow players whose payoffs cause them to simulate the gates of linear 
arithmetic circuits. 

A game of the above kind is said to simulate /. We say further that a game QQ simulates a partial bmf 
on a subset S of cubelets, if for any K ^ S, when {p[vx],p[vy], p[vz]) Us at the center of K the players 
{vx, Vy, Vz) are incentivized to play {p[vx], P^y], P^z]) + Sc, where c = f{K). 

Lemma 1 Given any linear graphical game QQi that simulates a Brouwer-mapping function fi, we can 
efficiently construct a new game QQ^ having a player Vswitch whose behavior can either cause QQ to 
simulate fi ( if v switch plays 1) or cause QQ to simulate /o if instead v switch plays 0. 

Vswitch shall serve as a "switch", in allowing the game to switch between simulating /o and /i (using 
an additional 3 players {v^ ,v:^,vf) whose strategies represent a point in K) according to whether v switch 
plays or 1 . Of course, v switch has a key role in the associated two-player game. 

Proof. For i G {0, 1}, let QQi be a graphical game constructed from fi according to Definitions |2j|4j|5] QQi 
has 3 players/vertices whose mixed strategies, as represented by the probabilities that they play 1 , represent 
a point in K. Denote these players (wj,, Vy,vl). 

Construct a "combined" game QQ^ as follows. QQ^ contains all the players in QQq and QQi together 
with a new player v switch^ where v switch has the same fixed payoff for playing either or 1. We add 3 players 
{Vx,Vy, vf) whose mixed strategies represent a point in K, and players {vx ,Vy ,vf), whose behavior is 
governed by 

P[^^^] = (P[^S] - P[Vswitch]) + {P[vi] - (1 - p[Vswitch])) 

p[u+]=max(p[w+], min(p[i;2], p[ui])) 

(and similar expressions for and vf) where the parentheses in the above expression are important since 
the outputs of the operators + and — are truncated to lie in [0, 1]. 

Players from QQq and QQi that take input from nodes vf or vj respectively, are then modified to take 
that input from instead. This completes the construction. □ 

Proof, of Proposition [3} We reduce from the circuit homotopy of Theorem[3] Let {Tt '■ ^ G [0, 1]} be an 
instance of this circuit homotopy. Construct Qi from J^i as per Definitionjs] Construct QQ^ as in Lemma[T] 
and we make the following observation. 

Observation 3 Suppose that in QQ^ we have p[vswitch] = ^ ^ (0, 1). The resulting game QQf simulates a 
partial Brouwer-mapping function ft which is implemented by a Brouwer function Tt that is (pointwise) a 
convex combination of and T\ and is defined on the subset of cubelets where /o = f\. Given a homotopy 
path of Nash equilibria ofQQ^ that start at the unique equilibrium ofQQ^ that satisfies p{vswitch] = and 
ends at an equilibrium of QQ^ in which p{vswitch\ = 1. there is a corresponding homotopy path from the 
fixpoint of and a fixpoint of Ti (noting that Q is essentially the same as (|7|j. 

That concludes the proof of Proposition |3] □ 
The following version of Lemma[T]is useful in the construction for Lemke-Howson solutions, later on. 
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Corollary 1 Given any linear graphical game QQi that simulates a Brouwer-mapping function fi, we can 
efficiently construct a new game QQ^ having 2 players Vgwitch cind v'^^^^^f^ whose behavior can either cause 
QQ^ to simulate fi (if both v switch, '^'switch P^^y 1) cr cause QQ^ to simulate /o if instead either or both 
play 0. 

Proof. The proof of the previous Lemma is modified as follows. We re-use Equation ^ for players 
{v^,v^, w^). The 3 players v ++, f ++) whose mixed strategies represent a point in K have behavior 

governed by 

p[vt+] = (p[^S]-pK™teJ) + (pb^]-(i-pK™tJ)) 

p[v++] = max(p[t;++],min(p[7;2],p[u+])) 
again with similar expressions for and vf^. □ 

4.4 From graphical to two-player strategic-form games 

In this subsection we prove the following theorem, from which Theorem[4]follows since we have previously 
noted membership of PSPACE. 

Theorem 5 It is PSPACE-hard to compute the Nash equilibrium of a given 2-player normal-form game Qi, 
that is obtained via the linear homotopy that starts from Qq, a version of Qi where the payoffs have been 
changed to give each player payoff 1 for his first strategy and Ofor the others. 

We reduce from the graphical game problem of Proposition [s] Let QQ^ be a linear graphical game that 
includes a player Vgwitch as per Proposition [s] First, modify QQ^ to give Vgwitch a small payment (say, 0.01) 
to play 1 , and zero to play 0. 

We define a homotopy between two-player strategic-form games Qq and Qi such that equilibria of Qi 
efficiently encode equilibria of QQ^, and equilibria of Qt encode equilibria of versions of QG^ where Vs^nch 
has a bias towards playing 0. We use the reduction of [6] (Section 6.1) from graphical games to 2-player 
games (a similar reduction is used in [4] (Section 7) to express generalized circuits (similar to our linear 
arithmetic circuits) as 2-player games). 

In the context of a mixed- strategy profile, let Pr[s] denote the probability allocated to pure strategy s by 
its player. 

Definition 6 A circuit-encoding 2-player game Q has a corresponding graphical game QQ where the graph 
of QQ is bipartite; denote it G = {Vi U V2, E); each player (vertex) in QQ has 2 actions (denote them 
and 1 ) and payoffs that depend on the behavior of 2 other players in the opposite side of G 's bipartition. 
Each vertex/action pair (v, a) of QQ has a corresponding strategy in Q; for v G Vi, (v, a) belongs to the 
row player and for f G V2, (f , a) belongs to the column player The payoffs in Q are designed to ensure that 
in a Nash equilibrium ofQ 

• Pr[(w, 0)] + Pr[(w, 1)] > l/2n where n is the number of players in QQ 

• if in QQ, v plays 1 with probability Pr[(f , l)]/(Pr[(?;, 0)] + Pr[(?;, 1)]) then we have a Nash equilib- 
rium ofQQ. 

Let C/ be a circuit-encoding game derived from QQ^ according to Definition[6] Associate Vswitch with 2 
strategies of of the column player of Q, and let and s^_,_^ be these strategies. Hence a Nash equilibrium of 
Q corresponds to one of QQ^ where the value p[wsTOtc7i] is given by the value Pr[s^^^]/ (Pr[s^] + Pr[s^^^]). 

Observation 4 If we take a circuit-encoding 2-player game, and award one of the players a small bonus to 
play (v, a), then this corresponds to incentivizing the player v in QQ to select strategy a. The corresponding 
incentive for v will be larger, but only polynomially larger 



12 



Let ^0 be a (n + 1) x (n + 1) game with strategies {sq, . . . sj^} for the row player, and {sg, . . . s^} for 
the column player. Payoffs are as follows: each player receives 1 for playing Sq or Sq, and for or Sj for 
j > 0. (Thus Qq is a (n + 1) X (n + 1) version of Equation (j2]l.) 

Rescale the payoffs of Q to all lie in the range [0- 9, 1- 1]. Let be a (n + 1) x (n + 1) game with 
strategies {sg, • • • , s^} for the row player, and {sq, . . . , s^} for the column player. Payoffs are as follows: 

• (^oi '^o) results in payoffs (0, —1) for the playersj^ 

• (sq, sp for j > results in payoffs (0, |). 

• (sj, Sg) for j > results in payoffs (— 1, |) for j 7^ A;, and (— 1, | + 5) (for 5 inverse polynomial in 
n) for j = k 

• The rest of Qi is a copy of Q above. 
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Let Qt = [1 — t)Qo + tQi. The above payoffs have been chosen so that Nash equilibria satisfy: in 
01, players do not use Sg or Sg; in Qq.q, players both have a proper mixture of Sg and Sg with their other 
strategies. Since ^'s payoffs were rescaled to lie in [0. 9, 1. 1], Pr[sg] and Pr[sQ] can be shown to lie in 
[0. 1, Q. 9], which can be checked from the following payoff ranges for Qos- 



Sq S^ . . . S, 



(0.4, -0.2) 


(0.4,0.45 + 5) 


(-0.6,0.85) 


([0.54,0.66], [0.54,0.66]) 



Thus a continuous path of equilibria should at some stage allocate gradually less and less probability to Sq 
and Sq as t increases. 

Observation 5 In any Nash equilibrium M ofQi, the players assign probability to Sq and Sq, and conse- 
quently N consists of a Nash equilibrium ofQ, restricting to strategies s^, Sj,forj, j' > 0. 

Since Qt = [1 — t)Qo + tQi, we can write Qt as 
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The general idea is as follows. Consider the Browder path of equilibria that begins from the unique 
equilibrium of ^g (where initially both players play Sq, Sg). As t increases, the players will start to use the 
other strategies. At that stage, consider the distribution of their mixed strategies restricted to , . . . , sj^ and 

^The two-component payoff vectors assign the first component to the row player and the second component to the column 
player. 
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si, . . . , s'f^. These distributions will constitute a Nash equilibrium of a version of Q in which the column 
player receives a small bonus for playing s^. As t increases to 1, the bonus decreases continuously to 0, 
and we recover Observation |5] Now, recall from Definition [6] that the way ||6l |4l reduce graphical games 
to two-player games, is to associate each player v in the graphical game with two strategies in the two- 
player game, both belonging to the same player. The division of probability between those two strategies 
represents the probability that v plays 1. Consider v switch now, corresponding to and s^^^. v switch is, in 
the graphical game, mildly incentivized to play 1 , but for f < 1 the 6 in the two-player game Qt pushes it 
the other way, towards 0. As a result, a Nash equilibrium of Qt may simulate a Nash equilibrium of QGf 
where p[vswitch] G (0, 1). As t increases and the contribution from 6 decreases, this process corresponds to 
raising p[v switch] continuously (but not monotonically) from to 1. 

Lemma 2 Let M be a Nash equilibrium ofQt in which Pr [sq] < 1 and Pr[sQ] < 1. Let V be the probability 
distributions over {s\, . . . , sj^} and {sf , . . . , s^} obtained by taking each value Pr[Sj] (for i G {r, c}, 
^ ^ j ^ n) and dividing it by 1 — Pr[sQ]. 

Then V is a Nash equilibrium of a version ofQ where the column player receives a bonus of 5 Pi[sq]/ (1 — 
PT[sl])forsl. 

Proof. In Nash equilibrium J\f, c's strategy Sq contributes the same quantity to each one of r's strategies 
s^, . . . , sl^. So the values Pr[s^], . . . , Pr[s^] must form a best response to c's mixed strategy from V. 

The column player receives a bonus 6t Pr[so] specific to s^, arising from the possibility that row player 
plays 0. He also receives an additional |t for all strategies Sj for j > 0, but that uniform bonus has no 
further effect on his preference amongst , . . . , s^. 

So in M, Pr [sf ] , . . . , Pr [s^] is a best response to a mixture of Q weighted by 1 — Pr [sq] and the proba- 
bility Pr[sQ] of a bonus 6 Pr[sQ] for playing s^. This is equivalent to a best response to a version of Q with 
a bonus of (5Pr[so]/(l — Pr[so]) for playing s^. □ 

Consider the path of equilibria connecting equilibrium Mq of Qq to equilibrium Mi of Qi. By Lemma[2] 
we can choose 6 such that in any equilibrium of ^o.5 we have Pr[s^^^] = 0. We also have that in any 
equilibrium of Qi, Pr[s^] = 0. Consider the longest suffix of the path for which t > 0.5 for all games 
Qt that appear in that suffix. The corresponding equilibria assign weight strictly less than 1 to Sq and sg, 
so Lemma [2] may be used to recover corresponding equilibria of versions of Q which in turn correspond to 
versions of QQ^ in which initially, v switch is incentivized to play 0, and finally, v switch is incentivized to 
play 1. 

5 From Linear Tracing to the homotopies of van den Elzen-Talman, Herings- 
van den Elzen, and Herings-Peeters 

In the previous section, we showed the PSPACE-completeness of finding the Nash equilibrium of a two- 
player game that is associated with a homotopy that uses a specific simple starting-game that is not derived 
from the game of interest. In the literature on homotopy methods, starting with Harsanyi 1 10], the starting- 
game is usually derived from the game of interest by positing a prior distribution over the players' pure 
strategies, and using a starting-game whose payoffs are the result of playing against this prior distribution. 
In this section, we extend the result of Section [4] to handle these starting-games and thus obtain results for 
the Herings-van den Elzen [13] and Herings-Peeters 1 14] algorithms, which use the same underlying homo- 
topy, and the van den Elzen-Talman [15] algorithm, which uses a different homotopy. All three algorithms 
have been shown under certain conditions to mimic the Harsayni-Selten linear tracing procedure. For each 
algorithm, we use the uniform distribution as the prior distribution, which is a natural choice. 
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The van den Elzen-Talman algorithm uses a homotopy based on a starting mixed-strategy profile v. 
Letting S be the set of mixed-strategy profiles, let be the set of convex combinations (1 — t){v} + iS. 
In the notation of [ 15|, the van den Elzen-Talman algorithm — restricted to the two-player case — uses the 
homotopy 

where for i = 1, 2, (cr) denotes the best responses of player i to mixed strategy a, restricted to 

Theorem 6 It is PSPACE-complete to compute equilibria that result from the above van den Elzen-Talman 
homotopy. 

Proof, (sketch) It can be checked that the algorithm uses polynomial space. For the hardness, we reduce 
from Linear tracing; consider a game Q for which we seek an equilibrium that results from starting with 
Qq of the form of (|2]). Suppose we take a game Q from Section|4]and give each player an additional strategy 
as follows. Let sj^ and be the new strategies, for the row and column player respectively. has a payoff 
of —10 for the row player, regardless of how the column player plays (thus, is dominated by all the other 
strategies). The payoffs to the column player are chosen in such a way that, if in fact the row player uses 
the uniform distribution over Sg, . . . , s^^, then the column player's payoffs will be 1 for Sq and for s^, for 
j > 0. The new strategy has a similar definition. Note that the new payoffs are at most n in absolute 
value. These new strategies ensure that we have the desired of Section|4j when we restrict each player to 
his first n strategies. Strategies sj^ and are not used in any Nash equilibrium of Qt, since they are strictly 
dominated for all t. 

We let V be the uniform distribution. In Qt, the row plays a mixture (1 — t)v + tal while the column 
player plays (1 — t)v + ta^, where (t[ and are mixed strategies whose support do not include s"^ and s^, 
so they constitute a Nash equilibrium of a version of Q in which there is a bonus to play Sq and Sq. This 
bonus drops continuously to zero, so it is equivalent to the linear-tracing homotopy. □ 

The algorithms of Herings-van den Elzen fT3] and Herings-Peeters fT4l are based on an identical ho- 
motopy and differ only in the numerical technique used to follow the homotopy path. We can show using 
essentially the same construction as above that is is PSPACE-hard to compute the equilibria found by these 
homotopies. To do so we can again construct a starting game by giving the row/column players new strate- 
gies and chosen to have low payoffs to the row (respectively, column) players, but whose payoffs 
to the opponent are chosen such that if either player played the uniform distribution, the opponent would 
receive a higher payoff for his first strategy (either Sq or Sq) than the others, which would all receive the 
same (lower) payoffs. 

6 From Linear Tracing to Lemke-Howson 

The Lemke-Howson (L-H) algorithm is an important and rich research subject in and by itself within Game 
Theory; for the purposes of this reduction, it is helpful to take a point of view that considers the L-H 
algorithm as a homotopy ifTSl . where an arbitrary strategy (the one whose label is dropped initially) is given 
a large "bonus" to be played, so that the unique equilibrium consists of that strategy together with its best 
response from the other player; the homotopy arises from reducing that bonus continuously to zero. 

Theorem 7 It is PSPACE-complete to find any of the solutions of a 2-player game that are constructed by 
the Lemke-Howson algorithm. 

The remainder of this section proves Theorem [Vj the hardness being established by a reduction from 
the graphical game problem of Proposition |3} extending the ideas of the reduction for LINEAR TRACING 
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(Theorems [4|[5]l. A new technical challenge here is that the choice of initially dropped label results in 2n 
alternative homotopy paths, and we must ensure that any of the (up to) 2n solution can encode the single 
solution to some instance of Linear tracing. 

Suppose that some strategy has been given this "L-H bonus", and a Browder path of Nash equilibria is 
obtained from reducing that bonus to zero. As before let t € [0, 1] be a parameter that denotes the distance 
from the starting game of the homotopy to the game of interest, so that 1 — t is a multiplicative weight for 
the bonus in intermediate games. Consider the Browder path. It is piecewise linear, a topologically well- 
behaved line. Let T G [0, 1] parameterize points along the Browder path — an equilibrium Mr is the one 
that is a fraction T of the distance along the path (starting at the version of the game with the L-H bonus). 
So, multiple values of T can correspond to the same value of t. Here we mostly focus on T rather than t. 

The following construction addresses the issue that an arbitrary strategy may receive the L-H bonus. We 
embed two copies of a circuit-encoding game Q (Definition [6]) into a game instance for the Lemke-Howson 
algorithm. At least one of those copies of Q will not contain the strategy that receives the L-H bonus. The 
L-H homotopy, restricted to that copy of Q, will simulate the homotopy of Section[4] 

In Figure [T] Q denotes a circuit-encoding n x n game (note the two copies) whose payoffs have been 
rescaled to lie in the interval [0- 4, 0-6]. Q is assumed to have an associated graphical game with two "switch" 
players v^^-^^f^, v'^^^^^f^ that affect the equilibria of Q according to Corollary [T] They will correspond to the 
first pair of each of ^'s players' strategies (sq, s'l) and (sg, s^) such that, 

• if both p[t's^jicft] ~ ^^'^ Pb^'switchl ~ ^' ^'^ equilibrium encodes a solution to an End OF THE LINE 
instance that is efficiently encoded by Q ; 

• if either p[vl^j^f.^f^] = or pf^g^jjc/i] = 0» ^ encodes the "basic" Brouwer-mapping function; 

• if we add a bonus to the row player for his first strategy Sq that is less than some threshold r, it will 
result in Pr[sQ] = and hence pbs^j^c/i] ~ ^^'^ similarly for the column player with respect to Sq 
and v1^^^^^. (We will see that such bonuses occur, and they decrease at T — > 1.) 

Notation. A, B, C, D and A' , B', C , D' denote sets of the players' strategies as shown in Figure[T] In the 
context of a mixed-strategy profile, Pr[C] denotes the probability that the column player uses C; Vt[A\ that 
he chooses an element of A, and so on. Let X{T) = Pr[C] + Pr[i:>] + Pr[C"] + Pr[L>'], a function of 
distance along the Browder path. We note the following facts 

• X(0) > 1 (if, say, a column player strategy receives the L-H bonus, then the row player will play 
some pure best response, either C" or D'; so Pr[C"] = 1 or Pr[D'] = 1.) 

• -'^(l) ^ ^ (shown in Lemma |4jl 

together with the key observation that X{T) is a continuous function of T, implying: 
Observation 6 For some T G [0, 1], X{T') = \, and for T > T, X{T) < |. 

Let Q be the copy of Q that does not contain the strategy that receives the L-H bonus. (If one of C, D, 
C or D' receive the L-H bonus, then Q may be either copy of Q.) 

For any X, at least one player p has an additional bonus at least X/4 to play Sq in Q (suppose for 
example Pr[C] + Pr[L'] > X/2 and p is the row player; Figure [T] awards additional e = 1 to p when C or 
D is played). But neither player's bonus exceeds X/2. As T increases from T' to 1, X{T) goes down from 
I to at most -jj. We will establish that when X{T) = \, Nt contains a solution to a "biased" version of Q 
where one of the players' first strategies (i.e. Sq or Sq) has an additional bonus (enough to ensure Pr[s]^] = 
and p[Usu)ite/i] ~ 0, in the case of the row player). Furthermore, when T = 1, we have that Mt contains a 
solution to Q, only with smaller biases. These biases are associated with "switch" strategies in the graphical 
game associated with Q. 
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Let T' be the largest value of T where X{T) is large enough that one of the bonuses sets Pr[sQ] = 1 in 
Q (for p G {r, c}). Between T' and T = 1 we pass through a continuum of equilibria where Pr[sQ] changes 
from 1 to 0; equivalently v['^^switch\ changes from to 1, and the resulting equilibrium at T = 1 corresponds 
to a solution to Oeotl. 

Lemma 3 Let Nt be a solution of Qt- If Q is the bottom right-hand copy of Q in Figure [7] then if the 
distributions over B and B' are normalised to 1, we have a Nash equilibrium of a game Q where the row 
player has an additional bonus o/e(Pr[C] + Pr[D])/ Pr[i?'] to play his first strategy Sq, and the column 
player has an additional bonus o/e(Pr[C"] + Pr[L''])/ Pr[i?] to play his first strategy Sq. 

By symmetry, a similar result also holds in the case that Q is the top right-hand copy of the Q. 

Proof. Payoffs to the row player are unaffected by the column player's distribution over A. Meanwhile, C 
and D lead to an additional bonus of e (weighted by the probability that C and D are used by the column 
player) for the row player to use the top row of B' . □ 

Lemma 4 At t = 1 (equivalently, T = 1) we have in any Nash equilibrium, that Pr[C] < Pr[D] < 
Pr[C"] < ^ and Pi[D'] < jj. Since M > 100 we have X(l) < ^. 



Proof, of Lemma [4] We give the proof that Pr[C] < jj-; by symmetry the other claims are similar. 

Assume for contradiction that Pr[C] > jj. We know that at least one strategy from AUB gives positive 
payoff, since the places where they can obtain a negative payoff are "equal and opposite". Indeed, it can be 
checked that the payoff to at least one member of A and B is at least ^ min{0- 4, e}. The (column player's) 
payoffs from C and D must sum to zero, so if Pr[C] > then Pr[Z)] = (D gets negative payoff and is 
a strictly worse response A or B.) A similar argument for the row player's payoffs establishes that one or 
both of C and D' gets zero probability. 

Given that Pr[C] > we can deduce that Pr[j4'] — due to being a worse response than B': C 
contributes at least ^ = 2 to payoff {B') — payoff {A'); A contributes a positive amount; D has zero 
probability so contributes nothing; B contributes > —0- 6. We noted above that Pr[C"] = or Pr[D'] = 
(or both). Consider two cases: 

Case 1: Pr[D'] = 0. Deleting strategies with probability zero, we are left with the following structure: 





A 


C 


B 


B' 


(1,0) 


([M,M + e],-M) 


([0-4, 0-6], [0-4, 0-6]) 


a 


(-M, [M,M + e]) 


(-M, M) 


(M, [-M, -M + e]) 



Comparing C with A, we need Pr[C"] = 1 to avoid A being a better response than C (which is supposed 
to have positive probability > -p=). If Pr[C"] = 1, B is a worse response than the others, but when the 
column player uses only A and C, C has much lower payoff than B' , contradicting assumption that Pr[(7'] 
is positive. This leaves us with Case 2: 

Case 2: Pr[C"] = 0. We get, after deleting zero-probability strategies. 





A 


C 


B 


B' 


(1,0) 


([M,M + e],-M) 


([0-4, 0-6], [0-4, 0-6]) 


D' 


(M, [-M,-M + e]) 


(M, -M) 


(-M, [M,M + e]) 



Here the contradiction is immediate since i? is a strictly better response than C, preventing Pr[C] > 
0. □ 
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Figure 1 : The game has 2 copies of n x n game Q embedded in the top-left and bottom-right regions, with 
payoff rescaled to [0- 4, 0- 6]. In the top-right and bottom-left regions are copies of a n x n game that give 
the column player a payoff of and the row player a payoff of 1 . 

Each of A, B, A', B' denotes a set of n strategies. C, D, C and D' are individual strategies. 
In the proofs we put M = 1000, e = 1. 
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Lemma 5 Assume that e < 1 in Figure\ljand that M > 100. Suppose that X {T) < i. ThenPr[A] > ^, 
Pt[B] > ^, Pt[A'] > ^, Ft[B'] > ^. 

Proof. We need to consider two cases in detail: case 1 assumes that an element of A' received the L-H 
bonus and case 2 assumes that C received the bonus. All other possibilities are essentially the same as these, 
by symmetry. 

Case 1. Suppose first that a strategy from A' has been given the L-H bonus, and we are at a Nash 
equilibrium where X = ^. 

First we prove that the row player strategies satisfy Pr[A'] > j^, Ft[B'] > ^. Since X < |, we have 
Pr[A'] + Pr[B'] > |. Suppose for a contradiction that Pr[B'] < ^, so that Pr[A'] > 0-65. Then (for 
e < I), C is the unique best response for the column player and hence Pi[C] = 1. This implies X > 1, 
contradicting the assumption that X < |. Similarly, if Pr[^'] < then Pr[B'] > 0- 65, then D is the 
column player's unique best response, hence Pr[Z)] = 1, again contradicting X < ^. 

Next we prove that the column player strategies satisfy Pr[A] > ^, Pr[B] > ^. Suppose for a 
contradiction that Pt[B] < j^, so that Pr[yl] > 0- 65. Then for the row player, D' is a better response 
than B' and C (and regarding A', some strategy from A' received the L-H bonus, so we do not claim A' 
is suboptimal). If B' is not a best response, so Pr[i?'] = 0, since X < ^ we have Pr[A'] > |. Then 
C is strictly better than A, contradicting Pt[A] > 0-65. Alternatively suppose that Pt[A] < j^, so that 
Pr [5] > 0- 65. Then C is a better response than B' and D', since payojf{C') > (0- 65 - 0- 35) A/ = 0- 3M; 
payojf{B') < \M since Pr[C] < 3 by assumption that X < \\ payojf{D') is negative. With D' and B' 
eliminated, C is a strictly better response than B, contradicting Pr[i?] > 0. 

Case 2. Suppose alternatively that it was strategy C that received the L-H bonus. 

We show first that the row player's strategies satisfy Pr[^'] > j^, Pt[B'] > j^. Suppose Pr[A'] < ^, 
so that Pr[B'] > 0- 65. D is a better response than A and B. But from that it follows that Pr[i:'] +Pr [C] = 1, 
contradicting X = ^. Suppose Pr[i?'] < ^, so that Pr[j4'] > 0- 65. C is a better response than A and B, 
so Pr[C] + Pr[L>] = 1 contradicting X = l- 

Next we show that the column player's strategies satisfy Pr[j4] > ^,Pr[i?] > ^. Suppose Pr[i?] < ^, 
so Pi[A] > 0- 65. D"s payoff is greater than 0- 3M while A' and B' have payoff at most jM + 1 so D' 
is a better response than A' and B', contradicting X < ^. Suppose Pr[yl] < j^, so Pv[B] > 0- 65. C" is a 
better response than A' and B' even ignoring the L-H bonus. □ 

At X{T) = J we have that at least one of Pr [C] , Pr [D] , Pr [C] , Pr [D'] is at least ^ , while at the end of 
the Browder path, we know that all these quantities are at most . We set the switch threshold probability 
to be somewhere between these, but we have to use lower bounds on Pr[yl], Pr[i?], Pr[y4'], Pr[i?'] at t = 1 
and upper bounds on these at X = ^ (as well as lower bounds on these at X = | to ensure that a Nash 
equilibrium of the "biased game" is being encoded). 

Finally, we need to show that there exists r such that the bonus from at least one switch strategy in Q 
changes continuously above r to below it, while the bonus for the other switch strategy ends up below r, 
thus initially, at least one value of pbg^jjcft] ^^'^ Pi'^'switchl zero, but at the end both evaluate to 1. This 
needs to take into account the variable amount of probability allocated to the strategies in G, since that 
affects the impact of the bonuses on Sq. 

For any T G [T', 1] the weight assigned by each player to ^'s strategies is at least ^ by Lemma [sj so 
that the bonus for player p to play Sg, falls by a larger factor than the probability that G is played. That 
means that r can indeed be chosen as required. 
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7 Discussion and Open Problems 



Should a more general result be obtainable? For example, perhaps it should be possible to identify general 
classes of "path-following algorithms" that include the ones we analyzed here, for which it is PSPACE- 
complete to compute their output. A potential obstacle is that such a general result may subsume the question 
of whether the 2-dimensional analogue of Oeotl is PSPACE-complete (i.e. consider the PPAD-complete 
problem 2d-Sperner [3J; suppose we ask for the trichromatic triangle identified in the proof of Spemer's 
Lemma.) In 2 dimensions, the gadget that is used to allow "edges" to cross each other, rearranges the 
structure of those edges, such that the corresponding solutions to End OF THE LINE are the same, but not 
the unique solution to Oeotl. Generally, there are many ways to modify the edges of a given {S, P)-graph 
so that the degree- 1 vertices are unchanged, but the structure of the graph is in other respects completely 
different. 

Von Stengel et al ll26l use a tracing procedure to solve extensive two-person games, and they obtain a 
normal form perfect equilibrium by starting from a completely mixed starting vector. What is the complexity 
of computing a normal form perfect solution using this (or other) methods? (They note ([26J, p. 707) that 
on strategic-form games this procedure mimics the linear tracing procedure of ifTTTl .) 
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8 Appendix: exponentially many changes of direction 

We give an outline of how to modify our first construction, so as to show that in following a sequence 
of approximate fixpoints of Brouwer functions, or equilibria of games, t may have to change direction 
exponentially many times, and furthermore, oscillate between values whose difference is bounded away 
from zero. 

Fq shall be the same as in Theorem [3] We construct a modified form of Fi, which we will call as 
follows. We identify two subsets of the cubelets Kijk, Ri and R2, defined as Ri = {Kijk : W < k < 12}, 



21 



and i?2 = {Kijk : 20 < A; < 22}. These subsets are "thin layers" of cubelets that are perpendicular to the 
z-axis. Now we define how J^J" behaves on points at the centers of cubelets. For x at the center of Kijk, 

if Kijk G Ri, Tfi^) -x = 10(-Fi(x) - x), 
if Kijk G R2, J^rix) -x = U^iix) - x), 
otherwise, TY^{x) = Fi{x). 

For points not at the centers of cubelets, shall interpolate between the values at the nearest cubelet 
centers, using the same general approach as Fi. Let J^" = (1 ~ *)-^o + tF"''. 

Let D be the set of cubelets Kijk for which fo{Kijk) 7^ /i {Kijk), so D is the region where fixpoints of 
J7" may exist. Let D~ be the connected component of D which contains the cubelet i^i,i,i, so is the 
region within which fixpoints of FJ^ will occur on the homotopy path. 

By construction, has the property that \ Ri (and similarly \ R2) has exponentially many 
connected components. This follows from the construction of O from which D is derived; D simulates 
a {S, P)-graph by, for each edge of the {S, P)-graph, including a long sequence of cubelets that passes 
through both Ri and R2. 

Now consider points in n Ri. The claim is that fixpoints x of FJ^ for which x G n Ri, satisfy 
t <:\, and that for fixpoints x of F^^ with x € n R2 we. have i > f ■ The general idea (in the first case; 
the second case is similar) is that F"^ will map points z in Ri to points z' for which at least one component of 
z' — zis greater than 10a. Hence for t > |, raises the value of this component (the positive contribution 
from FY^ exceeds the negative contribution from Fq) and prevents it from being a fixpoint. 

The homotopy path must pass through this long sequence of regions that require t > | or else t < |. 
Moreover, the two types of regions alternate, so we establish the following result: 

Theorem 8 For continuous functions defined using arithmetic circuits, the sequence of fixpoints along the 
path given by the linear homotopy (1 — t)FQ + tFi has exponentially many alternations of the value oft. 

We obtain the following corollary: 

Corollary 2 For graphical or two-player games, suppose Qq is a game that assigns each player a dominat- 
ing strategy, and Q is an arbitrary game. The linear tracing procedure for the homotopy (1 —t)QQ-\-tQ will, 
in the worst case, have exponentially many reversals oft. 

The corollary follows since, the way we represent Brouwer functions parameterized by t in terms of 
games parameterized by t, does not change the value of t. Since we did not change Fq, the associated game 
Go is the same "dominating strategy" game of Section]?] 

9 Appendix: polynomially small error 

We can use the machinery of Chen et al. (4\ so that when we talk about the hardness of finding an e-fixpoint, 
e is allowed to be inverse polynomial rather than inverse exponential. This is achieved by using the snake 
embeddings of L4J. 

Snake-embeddings A snake embedding reduces a low-dimensional Brouwer-mapping function having 2" 
of cubelets in each dimension, to a ©(n) dimensional bmf having 0(1) cubelets in each dimension, in such 
a way that panchromatic vertices of the high-dimensional bmf efficiently encode panchromatic vertices of 
the low-dimensional bmf. The reduction can be decomposed into a sequence of Q{n) iterations, in which 
at each iteration, the number of cubelets along some axis is reduced by a constant factor, and we acquire 
an additional axis having 0(1) cubelets (in [4| it is in fact 8 cubelets). (Intuitively, the space is folded a 
constant number of times and gains thickness along the new dimension.) 
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A specific snake-embedding, and some notation We consider a snake-embedding of a 3 -dimensional 
bmf /b of the type of Proposition [T] Initially the colors are {0, 1, 2, 3}; let a denote the new color at the 
i-th iteration and let s be the number of iterations required to reduce to 8 the number of cubelets along each 
axis. 

Let n = 4 + Cs be the dimension of the new snake-embedding. 

Definition 7 Let K" be the unit n-dimensional cube. Partition into cubelets where v G {0, 1 , . . . , 7}" 
represents a cubelet of edge length 1 /8. A Brouwer- mapping circuit maps each such cubelet to one of the 
colors {0, 1, 2...,n}. Again, a bmf should be polynomial-time computable, and map exterior cubelets to 
color i for cubelets whose i-th coordinate contains the first (when v contains a 0), otherwise color 0. 
Other types of bmf that correspond to Definition^are defined analogously. 

The high-dimensional bmf can be computed by a Brouwer-mapping circuit B' that is polynomial in the 
size of B. Let Jb' be the function computed. The challenge is to implement Jb' using an arithmetic circuit 
that is polynomial in the size of the circuit that computes the bmf, and computes a Lipschitz continuous 
function. The simplicial-decomposition technique of Theorem [2] no longer works when we move to non- 
constant dimension, since the number of simplices per cubelet is exponential in the dimension. 

Observation 7 After iteration i, we have a {i + 3) -dimensional bmf in which Ci becomes the "background 
color" corresponding to color in the original 3-dimensional instance. 

The cubelets having colors {0, 1, 2, 3, ci, . . . , Cj-i} are mapped to cubelets in the {i + 3) -dimensional 
instance in such a way as to have the same neighborhood structure, but with some duplication at the folds 
of the embedding. 

The continuous implementation Define Tb' '■ K'^ — ^ as follows. If x lies at the center of a cubelet 
(of length 1/8), letting J = fB'{x), Fb'{x) = x + (5j , where = (— a, — q, . . . , — a) if j is the background 
color Cs, otherwise 5j = (0, 0, . . . , 0, a, 0, 0, . . . , 0) where the position of the non-zero entry depends on the 
color j, and is chosen to satisfy the boundary conditions of a bmf. 

If X does not lie at the center of a cubelet, we claim that for each color j, we can efficiently compute the 
Loo distance from x to the closest center of a cubelet having color j, using a linear arithmetic circuit. Let 
dj{x) e [0, 1] be this distance. Let \j{x) = max(0, {^^ —dj{x))). Then define Tb'{x) = x + '^^j 
In that expression for Fb'{x), 5j is a constant vector, so we are not multiplying two computed quantities 
together (which is disallowed in a linear arithmetic circuit). 

Why it works If we are not within distance of a panchromatic vertex, then Xj = where j is one of 
the missing colors. However, there is some j' for which Xjt > jq. The choice of the vectors 6j ensure that 

\J='B'ix) -X\>^. 

Consequently any approximate fixpoint of Tb' is close to a panchromatic vertex of /b' . To show that 
it is close enough to permit that panchromatic vertex to be efficiently reconstructed from the coordinates of 
the fixpoint, it is easiest to assume that there are 24 rather than 8 cubelets along each axis, with the original 
cubelets having been divided into 27 smaller ones all having the same color. Then an approximate fixpoint 
can be assumed to lie within 315 of a panchromatic vertex. 

We also need to point out that the high-dimensional bmf /b' has a path of {0, 1, 2, 3, ci, . . . , c<j-i}- 
chromatic cubelets which simulates the path of {1, 2, 3}-chromatic cubelets in the bmf /b. Thus, the one 
obtained by following the path in Jb', encodes the one obtained by following the corresponding path in Jb. 

By way of a final remark, it is necessary for us to make a snake embedding of our 3D graph into higher 
dimension, rather than (as in iH) reduce from the 2D version of the problem [3 |. This is because the PPAD- 
completeness of 2D Sperner Pl is a reduction that alters the structure of the End OF THE LINE graph 
being encoded, and so would not (in an obvious way) apply in a reduction from Oeotl. 
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